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Abstract 

We propose a notion of contraction function for a family of graphs and establish its con¬ 
nection to the strong spatial mixing for spin systems. More specifically, we show that for 
anti-ferromagnetic Potts model on families of graphs characterized by a specific contraction 
function, the model exhibits strong spatial mixing, and if further the graphs exhibit certain 
local sparsity which are very natural and easy to satisfy by typical sparse graphs, then we also 
have FPTAS for computing the partition function. 

This new characterization of strong spatial mixing of multi-spin system does not require 
maximum degree of the graphs to be bounded, but instead it relates the decay of correlation 
of the model to a notion of effective average degree measured by the contraction of a function 
on the family of graphs. It also generalizes other notion of effective average degree which may 
determine the strong spatial mixing, such as the connective constant [SSY13, SSSY15], whose 
connection to strong spatial mixing is only known for very simple models and is not extendable 
to general spin systems. 

As direct consequences: (1) we obtain FPTAS for the partition function of g-state anti¬ 
ferromagnetic Potts model with activity 0 < f3 < 1 on graphs of maximum degree bounded by 
A when q > 3(1 — (3) A + 1, improving the previous best bound j3 > 3(1 — (3) A [LY13] and 
asymptotically approaching the inapproximability threshold q = (1 — (3) A [GvV13j; and (2) we 
obtain an efficient sampler (in the same sense of fully polynomial-time almost uniform sampler, 
FPAUS) for the Potts model on Erdos-Renyi random graph Q(n,d/n) with sufficiently large 
constant d, provided that q > 3(1 — /3)d + 4. In particular when /3 = 0, the sampler becomes 
an FPAUS for for proper g-coloring in Q(n,d/n) with q > 3d 4- 4, improving the current best 
bound q > 5.5 d for FPAUS for g-coloring in Q{n,d/n) [ ft 14a]. 


1 Introduction 

Spin systems are idealized models for local interactions with statistical behavior. In Computer 
Science, spin systems are widely used as a model for counting and sampling problems. The Potts 
model is a class of spin systems parameterized by the number of spin states q > 2 and an activity 
j3 > 0. Given an undirected graph G = (V, E), a configuration is a cr E [</]' that assigns each 
vertex in the graph one of the q states in [q]. Every configuration a is assigned by the model 
with weight wc(o') := n c = u ^G E /3 1 ( <T ^ =cr ^ • A probability distribution over all configurations, 
called the Gibbs measure , can be naturally defined as /u(cr) = WG ^ a ' ) where the normalizing factor 
Z = Z{G) = YlaG[q] v wg(&) is the partition function in statistical physics. When (3 < 1 the 
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interacting neighbors favor disagreeing spin states over agreeing ones, and the model is said to be 
anti-ferromagnetic. 

The partition function gives a general formulation of counting problems on graphs, whose exact 
computation is #P-hard. For example when /3 = 0, the partition function Z{G) gives the number 
of proper (/-colorings of graph G. There is a substantial body of works on the approximate counting 
of proper (/-colorings of graphs in the context of rapid mixing of a generic random walk, called the 
Glauber dynamics[ Jcr95, BD97, VigOO, DF03, Mol04, HV03, HV06, Hay03, DFHV04]. 

An exciting accomplishment in recent years is in relating the approximability of partition func¬ 
tion for spin systems to the phase transition of the model on the infinite regular trees, also known 
as Bethe lattices. Here the exact phase transition property we are concerned with is the presence 
of the decay of correlation , also called the spatial mixing : assuming arbitrary possible boundary 
conditions on all vertices at distance l from the root in the regular tree, the error for the marginal 
distribution at the root measured by the total variation distance goes to 0 as i —>• oo. The de¬ 
cay of correlation of the model on the infinite A-regular tree undergoes phase transition as the 
activity parameter /3 crosses the critical threshold in terms of q and A, called the uniqueness 
threshold as it corresponds to the transition threshold for the uniqueness of Gibbs measures on the 
infinite regular tree. For proper q-colorings, the uniqueness condition for Gibbs measure on the 
d-regular tree is q > A + 1 [Jon02], and for anti-ferromagnetic Potts model with 0 < j3 < 1, the 
uniqueness threshold on the A-regular tree is conjectured to be q = (1 — /3)d which was proved 
to be the threshold for the uniqueness of semi-translation invariant Gibbs measures on d-regular 
tree [GvV13]. For anti-ferromagnetic 2-spin systems (where q = 2), it was settled through a series 
of works [WciOG, SST12, Y13, SlylO, GvV12, SS12] that the transition of approximability of the 
partition function for the model on graphs of bounded maximum degree is precisely characterized 
by the phase transition of the model on regular trees. 

For multi-spin systems (where q > 3), in a seminal work of Gamarnik and Katz [GK12], the 
decay of correlation was used to give deterministic FPTAS for counting proper g-colorings of graphs 
and for computing the partition function of general multi-spin systems. This is the first determin¬ 
istic approximation algorithm for multi-spin systems and also one of the first few deterministic 
approximation algorithms for ^P-hard counting problems. The specific notion of decay of correla¬ 
tion established in [ K 12] is a stronger one, namely the strong spatial mixing , where the correlation 
decay is required to hold even conditioning on an arbitrary configuration partially specified on a 
subset of vertices. Later in [ 3], the strong spatial mixing was established for proper q- 

colorings on graphs of maximum degree bounded by A when q > aA + 1 for a > a* ~ 1.763.., 
which is best bound known for strong spatial mixing for colorings of graphs of bounded maximum 
degree. When the parameters of the model are in the nonuniqueness regime, there is long-range 
correlation. In [GvV13], this was used to establish the inapproximability of the partition func¬ 
tion for anti-ferromagnetic Potts model on graphs with maximum degree beyond the uniqueness 
threshold. 

In this paper, we are interested in the spatial mixing and FPTAS for spin systems on graphs 
with unbounded maximum degree. For some special 2-spin systems such as the hardcore model 
and Ising model with zero field, this was achieved by relating the decay of correlation property to 
a notion of effective degree of a family of graphs, called connective constant. Roughly speaking, 
the connective constant of a family Q of graphs is bounded by A if for all graphs G from Q the 
number of self-avoiding walks in G of length i starting from any vertex v is bounded by n°^ A^. 
In [ 1Y13, SSSY 15], the exact reliance on the maximum degree by the correlation decay and FPTAS 
for the hardcore model in [ 7ei06] was replaced by that on the connective constant. However, as 
pointed out in [SSS ], the approach does not extend to general 2-spin systems. For multi¬ 
spin systems, the cases for unbounded maximum degree was studied in the literature mostly in 
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the context of sampling proper (/-colorings of Erdos-Renyi graphs Q(n,d/n ) with constant average 
degree d [DFFV06, ES08, MS10, Eftlda, Eftl2, Eftl4b], 

Our goals are to establish strong spatial mixing and to give FPTAS for multi-spin systems on 
families of general graphs with unbounded maximum degree that are not restricted to Q(n,d/n). 
It turns out that for multi-spin systems, in order to achieve these goals we need a more robust way 
than connective constant to measure the effective average degree. We illustrate the necessity of 
this by a class of bad instances of graphs called caterpillars, which were also considered in [ in 14] 
and [ ]. The caterpillars as in Figure 1 are paths P = (v\,V 2 , - ■ ■ ,v n ) with each v t adjoined 

with k bristles. Consider in particular the proper (/-colorings. For the caterpillars with k > q — 2, 
the vertices on bristles can always be fixed in a way to force the vertices on the path to alternate 
between two colors, which means the strong spatial mixing does not hold on any caterpillar with 
a k > q — 2. On the other hand, the connective constant of the family of all caterpillars is only 1. 
This simple example shows how different multi-spin systems can be from the Ising model on graphs 
of unbounded maximum degree. It also suggests that for multi-spin systems we should take into 
account both the branching factor (which measures the long-range growth of self-avoiding walks) 
and the actual degrees of individual vertices (which locally affect the decay of correlation) when 
studying the decay of correlation on graphs of unbounded maximum degree. 


vi v 2 v 3 v 4 v 5 

Figure 1: A caterpillar with n = 6, k = 3 and q = 



5. 


1.1 Contributions 

In this paper, we prove strong spatial mixing and give FPTAS for (/-state anti-ferromagnetic Potts 
model and in particular the g-colorings, for families of sparse graphs with unbounded maximum 
degree. We achieve this by relating strong spatial mixing on a family of graphs to the function 
which is contracting on the graphs from the family, a notion that generalizes the connective con¬ 
stant [SSY13, SSS'i ] and properly measures the effective average degree that affects the decay of 
correlation in general spin systems. 

As connective constant, it is convenient to talk about contraction function on infinite graphs. 
Given a vertex v in a locally finite infinite graph G(V,E), let SAW(u,I?) denote the set of self¬ 
avoiding walks in G of length t starting at v. The function <5 : PI —>• M + is a contraction function 
for graph G if 


sup lim sup £s (v, £) = 0 
vEV £—^oo 


i 

where £s(v, £) := e n <5(deg (vf)), 

(v,Vi,...,v e ) i =1 
gSAW(o,Z) 


This definition can be naturally extended to a family Q of finite graphs, in such a way that <5(-) is 
contracting for Q if £$(v, £) is of exponential decay in i for all G € Q and any vertex v in G (see 
Section 2). 

We can use the contraction function d(-) to describe various families Q of graphs. For example, 
the families Q of graphs with maximum degree bounded by A can be described precisely by the 
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contraction function such that 5(d) = i if d < A and 5(d) = oo if otherwise. The contraction 
function also gives a more robust way than the connective constant to capture average degrees. 


Proposition 1. The families of graphs G with connective constant bounded strictly by A are pre¬ 
cisely the families G for which the constant function 5(d) = ^ is a contraction function. 

The FPTAS in [ ] for the hardcore model with activity A on the families of graphs with 

bounded connective constant is actually an FPTAS for any family G of graphs for which a critically 
defined 5(-) is a contraction function. 1 It was proved in [SSSY ] that 5(d) < for all d with the 
equality holds precisely at the critical threshold d = A c , therefore with the notion of contraction 
function we observe that the FPTAS in [ ] works on strictly broader families of graphs than 

what was guaranteed in [ 1SY15] with the connective constant. 

Our first result relates the decay of correlation (in the sense of strong spatial mixing) of the 
anti-ferromagnetic Potts model on a family G of general graphs to its contraction function. 


Theorem 2 (Main theorem: strong spatial mixing). Let q > 3 be an integer and 0 < (3 < 1. Let 
G be a family of finite graphs that satisfy the followings: 

• the following <5(-) is a contraction function for Q: 


5(d) 


• 2 ( 1 — P) 
q-l-(l-p)d 


ifd< fEj-2, 

otherwise; 


( 1 ) 


• (proper q-coloring) if (3 = 0, the family Q also needs to be q-colorable. 

Then the q-state Potts model with activity /3 exhibits strong spatial mixing on all graphs in G ■ 

Remark. The contraction function describes a notion of average degree and the theorem holds for 
graphs with unbounded maximum degree. For example, for Erdos-Renyi random graph G(n,d/n), 
which with high probability has constant average degree (1 — o(l))d and unbounded maximum 
degree 0(log nj log log n), assuming q > 3(1 — /3)d + 0(1) with high probability the above 5(-) is 
a contraction function for G(n,d/n). Note that q = (1 — (3c) A is the uniqueness/nonuniqueness 
threshold for semi-translation invariant Gibbs measures of Potts model on the infinite A-regular 
tree Ta, which is also conjectured to be the uniqueness/nonuniqueness threshold [GvV13]. 

In order for the SSM to imply an FPTAS for computing the partition function, we need the 
graphs to be sparse in a slightly more restrictive manner than what guaranteed by the above 
contraction function. The conditions of (1) classify the vertices in a graph into low-degree vertices 
(if deg (u) < — 2) and high-degree vertices (if otherwise). A graph G = (V,E) is said to be 

locally sparse if for every path P in G of length £, the total size of clusters of high-degree vertices 
growing from path P is bounded by 0(1+ log |R|) (see Section 2 for a formal definition). Intuitively, 
a graph is locally sparse if all clusters of high-degree vertices are small (of size 0(log |Vj)) and are 
relatively far away from each other. 

Theorem 3 (Main theorem: approximate counting). Let q > 3 be an integer and 0 < (3 < 1. Let 
Q be a family of locally sparse graphs satisfying the conditions of Theorem 2. Then there is an 
FPTAS for the partition function of the q-state Potts model with activity (3 for all graphs in G 

1 This <5(-) is defined formally as follows: 5(d) = de _1 (^, where x is the unique positive solution to 
dx = A(1 + x)~ d + 1 and p = 2 — (A c — 1) In ^1 + Ac 1 _ 1 ^ where A c = A C (A) is the critical (real) degree satisfying 

\ _ A * c 
A ~ (A c -l)4c + l ' 
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Remark. This is the first FPTAS for the general Potts model on families of graphs of unbounded 
maximum degree. And even for graphs with bounded maximum degree A Theorem 3 holds with a 
much better bound q > 3(1— /3)A+1, which greatly improves the best previous bound 3(1— (3)A < /3 
known for anti-ferromagnetic Potts model on graphs with bounded maximum degree [LY13]. Recall 
that q = (1 — P c )A is the semi-translation invariant uniqueness threshold on the infinite d-regular 
tree Ta, and the problem is hard to approximate for all even q satisfying q < (1 — /3)A [GvV13]. 

To evaluate how tight Theorem 2 and 3 could be, we instead consider an idealized goal: to find 
a <5(-) (depending only on the model) such that the strong spatial mixing rate with respect to any 
v in a graph G is always bounded by £s(v,£). With this stronger requirement, for (/-coloring, the 
following 5(-) is the best we can hope for: 


8(d) 


q—d—1 

l 


if d < q — 2 
otherwise 


because it is achieved by the SSM rate on caterpillars as illustrated in Figure 1 when the colors of 
leaves are fixed properly. This certainly gives a lower bound to any contraction function for the 
above idealized goal. Note that the 5(-) function in (1) in the case of (/-coloring (when /3 = 0) is 
precisely twice this lower bound function. And this factor 2 is due to an intrinsic obstacle in the 
current approaches for correlation-decay based algorithms for multi-spin systems. 

As an application of Theorem 3 we consider the Erdos-Renyi random graph Q(n,d/n) with 
constant average degree d. It is well known that the partition function of the Potts model on 
this random graph is highly concentrated to its expectation which is easy to calculate [COV13], so 
for this model sampling is more interesting than counting. And because the FPTAS in Theorem 3 
actually works for a broader self-reducible family of instances (for example, the list-colorings instead 
of just g-colorings), we also have efficient sampling algorithms due to the standard Jerrum-Valiant- 
Vazirani reduction [ ]. So we prove the following result. 

Theorem 4. Let d be sufficiently large and G ~ Q(n,d/n). Let 0 < /3 < 1 and q > 3(1 — (3)d + 4. 
There exists an algorithm S such that for any e > 0 with high probability S returns a random 
configuration in [q] v ^ G ' ) from a distribution that is within total variation distance e from the Gibbs 
distribution \ig for the q-state Potts model with activity /?. And the running time of S is in 
polynomial in n and log j. When @ = 0, i.e. for q > 3d + 4, with high probability S is an FPAUS 
(fully polynomial-time almost uniform sampler) for proper q-colorings of G. 

This is the first result for sampling in general Potts model on Q(n,d/n) and also improves 
the state of the arts for the special case of sampling proper g-colorings of Q(n,d/n). For this 
special case of the problem, FPAUSes have been obtained mostly by the rapidly mixing of certain 
block Glauber dynamics [DFFV06, ES08, MS10, Eftl4a]. The best bound q > 5.5 d was achieved 
in [ la], whereas our bound for proper (/-coloring is q > 3d + 4. Further, the results of [ V06, 

ES08, MS10, Eftl4a] are more restricted to Q(n,d/n). In contrast, our algorithm is more generic 
and works for all families of locally sparse graphs with a bounded growth rate of self-avoiding walks 
measured properly by the contraction function. 

In [ it 12, Eft 1 11)], sampling algorithms with a much weaker control of total variation errors 
than FPAUS were considered, and with this weaker sampler a better bound q > (1 + e)d was 
achieved, which almost approaches the uniqueness threshold. 


1.2 Techniques 

The approximation algorithms for multi-spin systems (e.g. graph coloring) have been studied in 
the literature in the context of rapid mixing of Glauber dynamics. In a seminal work of Gamarnik 
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and Katz [ ], the correlation-decay based deterministic algorithms for multi-spin systems are 

introduced. Our analysis of decay of correlation utilizes the approach of [ ] in an essential 

manner. 

Consider for example the proper ^-colorings of graph G. If there is a vertex v in G with degree 
much higher than q , then the Glauber dynamics will have torpid mixing around v since the color of 
v will be frozen at most of the time; and the bound on the decay of correlation also breaks at this 
high-degree vertex because locally it may have absolute correlation with the neighbors. Further, 
for graph coloring with maximum degree unbounded, even the feasibility becomes an issue. These 
bad situations were dealt with in [ FV06, MS10, Eft 14a] by using block dynamics, where the 

block contains the high-degree vertices as its core which is separated from the boundary by a buffer 
of small-degree vertices. As the bound on q getting tighter, the constructions of such blocks have 
to be highly delicate to meet this requirement. 

The novelty in our approach is that we give a correlation-decay based deterministic algorithm 
that works in terms of blocks. A key observation of us is that despite the correlation can be absolute 
between a pair of high-degree vertices within the same block, its contribution to the decay rate 
along a self-avoiding walk is at most a factor 1 (hence the 5(d) = 1 branch for large d in (1)), 
while the low-degree vertices at the boundaries of blocks contribute to the decay of correlation 
as in the bounded degree case. While this observation is made to the original Gibbs measure, 
algorithmically it can be witnessed by applying the recursion in terms of marginal distributions on 
blocks. In contrast to the block construction in [ FFV06, ES08, MS10, Eft 14a], the blocks in our 
algorithm are extremely simple and generic: they are just clusters of high-degree vertices. This 
simple construction of blocks makes our algorithm more generic and works on general graphs. 

In a previous work [Yinl4], this idea of block version of decay of correlation was used to establish 
a “spatial mixing only” result for random graphs, augmented from the result of Gamarnik et 
al. [ IK Ml 3] for graphs of bounded maximum degree. The current work emphasizes the strong 
spatial mixings that have algorithmic implications. Here the property of being locally sparse is 
used to bound the running time. 

2 Preliminaries 

Let G = ( V., E) be an undirected graph. For any subset S C V of vertice, let GfS 1 ] denote the 
subgraph of G induced by S, and let dB = {u € V \ B \ 3w € B, (it, w) € E} denote the vertex 
boundary of B. Given a vertex v in G, let distc (v, S ) denote the minimum distance from v to any 
vertex u £ S in G. 

Potts model and spatial mixing. The Potts model is parameterized by an integer q > 2 and 
a real (3 > 0 called the activity parameter. Each element of [q] is called a color or a state. Let 
G = (V, E) be a graph. A configuration cr £ [g] A on a subset A C V of vertices assigns each vertex 
v in A one of the q colors in [q] . In the Potts model on graph G, each configuration a £ [q] ^ is 
assigned a weight 

w G (a) = 

where f mon((j) = | {(u, v) € E \ cr(u) = o’(i’)} | gives the number of monochromatic (undirected) 
edges in the configuration a. 

In order to study strong spatial mixing, we consider the instances of Potts model with boundary 
conditions. An instance of Potts model is a tuple Q = (G, A, a) where G = (V , E) is an undirected 
graph, A C V is a subset of vertices in G and cr £ [q] A is a configuration on A. Given such an 
instance H = (G, A, cr), the weight function wq assigns each configuration 7r £ [q]^ the weight 
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wq{tt) = wg{tt) if vr agrees with a over all vertices in A, and wq(tt) = 0 if otherwise. An instance 
Gl is feasible if there exists a configuration on V with positive weight. This gives rise to a nature 
probability distribution over all configurations on V for a feasible Potts instance: 


Pro [c(V) = 7r] 


wq(tt) 
Z{Gl) ’ 


where Z(Gl) = w ^( a ) i s ca ^ th e partition function. This probability distribution is called 

the Gibbs measure. For a vertex v € V and any color x € [g], we use Pro [c(u) = x\ to denote the 
marginal probability that v is assigned color x by a configuration sampled from the Gibbs measure. 
Similarly, for a set S C V and 7r 6 [g] , we use Pro [c(S) = 7r] to denote the marginal probability 
that S is assigned configuration 7r by a configuration sampled from the Gibbs measure. Next we 
define the notion of strong spatial mixing. 

Definition 5 (strong spatial mixing). Let Q be a family of graphs. We say that the g-state 
Potts model with activity /3 exhibits strong spatial mixing on all graphs in Q if there exist positive 
constants C > 0, 7 < 1 such that for every G = (V. E ) € G, every v € V, A C V, x G [g], and any 
configurations < 7 , r € [g] A that Qi = (G, A, a) and GI 2 = {G, A, r) are both feasible, it holds that 

|Proj [c(u) = x] - Pro 2 [c(v) = x}\ < n c 7 dlst cG> A ), 


where n = \ V\ and A C A is the set of vertices on which a and r differ. 

If we change distc (v, A) to distg (v, A) in the definition it becomes the definition of weak spatial 
mixing. 


Permissive block and locally sparse. Fix any q > 2 and 0 < f5 < 1. Let fl = (G,A,a) be an 
instance of g-state Potts model with activity /3 and v a vertex in G. We call v a low-degree vertex 
if deg G (u) < ~ 2, and otherwise we call it a high-degree vertex. 

Definition 6 (permissive block). Let Gl = (G, A, a) be a Potts instance where G = (V,E). A 
vertex set B C V \ A is a permissive block in Gl if every boundary vertex u € dB \ A is a low-degree 
vertex. For any subset of vertices S C V \ A, we denote B(S) = Bq(S) the minimal permissive 
block containing S. And we write B(v) = B(S) if S' = {u} is a singleton. 

Definition 7. A family Q of hnite graphs is locally sparse if there exists a constant C > 0 such that 
for every G = (V, E) in the family and every path P in G of length t we have \B(P)\ < C(£- Flog |P|). 

Feasibility and local feasibility. Let = (G, A, a) where G = (V. E) be an instance of Potts 
model, v € V \ A be a vertex. For a subset of vertices S C V \ A, a configuration 7r e [q] s is 
(globally) feasible if there exists a configuration on V with positive weight and agrees with 7r on S. 
A configuration 7r £ [g] 5 is locally feasible , if tc G [aus] U it) > 0, where a U 7r is the configuration 
over A U S that agrees with both a and ir. 

The disucssion of feasibility and local feasibility is meaningful only when (3 = 0. In this case, 
the local feasibility of a configuration on a permissive block implies the (global) feasibility. 

Proposition 8. Let Gl = (G. A, a) where G = (V, E) be an feasible instance of Potts model with 
f3 = 0,v£V\Abea vertex and it € [qbe a locally feasible configuration. Then 7 r is also 
feasible. 
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Proof. Denote B = B(v). Fix a configuration rj E [q] v such that wn(rj) > 0, this is possible since Q 
is feasible. We denote by rj the restriction of 77 to V \ ((B U dB ) \ A), i.e., the set of vertices that 
are either in A, or not in B U dB. 

Consider the configuration 77 = n U rj E [<?]' . it can be extended to a configuration 

p E [ 5 ]' with vun(p) > 0 in a greedy fashion, since every vertex in dB \ A is of low-degree. Thus p 
witness that 7 r is feasible. □ 

With this proposition, we do not distinguish between local feasibility and feasiblity of configu¬ 
rations on permissive blocks. For a permissive block B, we use J-(B ) to denote the set of feasible 
configuration. Note that when f3 > 0, the set E(B) is simply [q] B . 

Self-avoiding walk tree. Given a graph G = (V, E ) and a vertex v E V, a rooted tree T can 
be naturally constructed from all self-avoiding walks starting from v in G as follows: Each vertex 
in T corresponds to a self-avoiding walk (simple path in G) P = (v,v 1 , t> 2 ,..., Vk) starting from v, 
whose children correspond to all self-avoiding walks (v, vi, v %,..., v^. v ^ + \) in G extending P, and 
the root of T corresponds to the trivial walk (v). The resulting tree, denoted by Tsaw (G,v), is 
called the self-avoiding walk (SAW) tree constructed from vertex v in graph G. 

From this construction, every vertex in Tsaw (G, v) can be naturally identified with the vertex 
in V (many-to-one) at which the corresponding self-avoiding walk ends. 

Connective constant and contraction function. Given a vertex v in a locally finite graph 
G(V,E), let SA\N(v,£) denote the set of self-avoiding walks in G of length l starting at v. The 
following notion of connective constant of families of finite graphs is introduced in [ !Y13]. 

Definition 9 (connective constant [SSY13, SSSY15]). Let Q be a family of finite graphs. The 
connective constant of Q is bounded by A if there exists a positive constant C > 0 such that for 
any graph G = (V, E) in Q and any vertex v in G, we have |SAW(u,£)| < n c A £ where n = \V\ for 

all t> 1 . 

Let S : N —» M + be a function. Given a vertex v in a locally finite graph G(V,E), let 

t 

£s(v,£) := e n 5(deg (vi)). 

( v,Vi,...,V£ ) i =1 

eSAWO/) 

Definition 10 (contraction function). Let Q be a family of finite graphs. The <5 : N —>• is a 

contraction function for Q if there exist positive constants C > 0, 7 < 1 such that for any graph 
G = (V, E) in Q and any vertex v in G, we have £s(v,£) < n c 7 ^ where n = \V\ for all £ > 1. 

It is easy to see that graph families Q with constant contraction function 5(d) = ^ are precisely 
the families Q of connective constant bounded strictly by A. 

3 Recursion 

In this section, we introduce recursions to compute the marginal probability on a vertex and on a 
permissive block in Potts model respectively. 

Let Q = (G, A, a) where G = (V, E) be an instance of Potts model and v G V \ A be a vertex. 
Let B = B(v) be the minimal permissive block containing v. Let SB = {uiVi \ i € [m]} be an 
enumeration of boundary edges of B where Uj fL B for every i € [m]. In this notation, more than 


one Ui or ft may refer to the same vertex. We denote E(B ) := {uv G E \ u,v € B} the edges in B. 
We use B to denote the inner boundary of B , i.e., B = {uGB\uvGE and v 0 B}. 

Recall that we use E(B) to denote the set of feasible configurations on a permissive block B, it 
is easy to see that, for every x G [q], 

Prn [c(v) = x] = Pro \c{B) = i r]. 

7r(v)=X 


This identity relates the marginal probability on a vertex to marginal probabilities on a block. We 
now define notations for some sub-instances and give a block-to-vertices identity. 

Let 7T G E(B) be a configuration on a permissive block B. For every i G [m], denote ft = 7r(ft). 
Let Gb = (Vb, Eb) denote the graph obtained from G by removing B\B and edges in E(B), i.e., 
V' = (V \ B) U B, E' = E \ E(B). Let VLb = (Gb, A, <r). For every i = 1, 2,. .. , m + 1, define 
= (Gf, A| r , <x| r ) as the instance obtained from 12 b by fixing Uj to color 7Tj for every j G [i — 1] 
and by removing edges UjVj for every j = i, i + 1 ,... , m. 

Lemma 11. Assuming above notations, it holds that 


Prn [c(B) = 7r] 


WG[B\(.n) • mu ( x - ( x - /?) pr w = ^0 

EpeT(B) w G[B]{P ) ' mil (!-(!- £) Pr nf [c(ft) = ft]) 


( 2 ) 


Proof. 


Pr^ [c(R) = 7r] 


WG[B](tt) • Z (Ki+l) 

^2p£T(B) w G[B]{p) ' ^(^m+l) 


Since for every p G E(B) and i G [d], 


^+i) = E z l c K) = y) • 0 1(l,=p(tti)) 

2/e[?] 


where Z (fF* | c(ft) = y) stands for the sum of the weights of all feasible configurations a on 
satisfying cr(ft) = y and l(-) is the indicator function. With this identity, we can further write 


Prn [c(B) = 7r] = 


wg [ b \( a ) ■ n;=i 


£ w 6 r ,,z(n* | c{vi)= y )-p^y=^i)) 

z{ W) 


m S,6fal z ( n ? l c C p «)=y)-' 8l(,=p(l ‘ <)) 


SpeJ-(B) W G.B (p) • Hi 1 ' Z(Of) 

^G[B]0) • n:=i (!-(!- /3)Prw t c (^) = ^]) 


Epe^(B) w G[B](p) • rn=i (!-(!- ^) Pr nf [ c ( v i) = Pi 


□ 
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This identity expresses the marginal probability on a permissive block as the function of marginal 
probabilities on its incident vertices, with modified instances. We now analyze the derivatives of 
this function. 


Lemma 12. Let p = ,P = (Pi,p)ie[m], P eT(B) be two tu P les of variables and 

,, . _ w g[b] (tt) nSi (1 - (1 - P)Pi,*) 

np) " e^b) w g [ b \( p ) n;=i a - (i - pm ' 

Assume for every i G [m],p G F(B(v)), Pi, p ,Pi, p < q _^_^ d . , then 

|log/(p) - log/(p)| < V- 5 1 f] -t • max |log p ip - logp i)P | . 

fzf, Q~{ 1 ~ P)<k - 1 P &[q] B{v) 


Proof. For every i G [to], we have 


g/ 

dpi^TT 

For every i G [to] and p / 7 r, we have 


(1-/?)/(!-/)• 


1 

1 - (1 - (3)pi j7r ' 


df , , w G[B] (p) rn=i(! - u - fiKp) _ i 

jr(B) w G [b](o-) n:=i(! - u - p)Pi,°) i - (i - /3)Pi, P ' 


Thus, 


E 

PG-F(S) 

p^TT 


df 

dpi.p 


<(1 -/?)/(!-/)• 


1 

max - 7 -—-. 

pgJ-(B) 1 - (1 - /3)pi )P 

p^TT 


Let = 7 , by mean value theorem, for some p = {Pi, P )i e [m], P e[q] B ^'> wh ere each p ijP < , 

we have 


I log / Cp) — log /(p) | 



df 

\ 

dpi, P 

) 


• I log Pi,p 


log Pi ,p 


sE 

i€[m] 


/ 

£Cf) 

$(Pi,7r) 

V 


a/ 

dpi^-ir 


+ E 

P&HB) 

pA™ 


Hf) 
$(p*,p) 


\ 

a/ 

dpi.p 


max | log Pi,p 
p&[q] B (P 


log Pi,p 


SE 

ie[m] 


(1-/3) 


Pi, 7\ 


1 - (1 - £)pt 


+ max 


Pi,p 


pgJ-(B) 1 - (1 - P)pi, p 

p^TT 


< v- _ 2 (1 - /3) _ 


max | log pjn 

pg[q]SW 


logPi, p 


max | log pi iP - log pi,p 

pe[q\ B M 


□ 


The following lemma gives an upper bound for the probability Pr^ [c(u) = x\. 
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Lemma 13. Assume q > (1 — f3)d. For every color x E [q], it holds that 


Pr n [c(u) 


X] - q~( 


where d is the degree of v in G. 

Proof. Assume x = 1. For every i € [g], let Xi denote the number of neighbors of v that are of color 
i. Then p v j < max 1 ^ x . subject to the constraints that all x ? ; are nonnegative integers and 

Xi = d. Since B < 1, we can assume x\ = 0, thus p v i < max . We now distinguish 

’ l +l^i=2P 1 

between two cases: 

1. (If d > q — 1) In this case, let A = 1 — (5, then 

1 1 <9 1 _ 1 

!+Ef= 2 @ Xi ~ 1 + (q- 1)(1 - A)Wi _ 1 + (q - 1) ^1 - Q ~ i 1 ~ P) d ’ 

where W is due to the fact that the inequality (1 — a) b >l — ab holds when 0 < a < 1 and 

b> 1. 

2. (If d < q — 1) In this case, due to the integral constraint of xfs, the term E ?=2 P Xi minimizes 
when d of xfs are set to one and remaining xfs are set to zero. Therefore, we have 

1 i _ i 

1 + Y?i= 2 P Xi ~ 1 + dfi + (q - 1 - d) q-(l-P)d' 


□ 


The recursion (2) holds for arbitrary set of vertices B (not necessary a permissive block), thus if 
one takes B as a single vertex, it implies the following simple lower bound for marginal probabilities 
on a vertex. 


Lemma 14. For every feasible x € [g], it holds that 


Prn [c(v) 



where d is the degree of v in G. 


4 Strong Spatial Mixing 


We prove the strong spatial mixing property for Potts model in this section. Recall that 

2(l-/3) 

)d 

otherwise. 


5(d) = l -fi)d 


Theorem 2 is restated in a formal way: 

Theorem 15. Let q > 3 be an integer and 0 < /3 < 1. Let Q be a family of finite graphs that satisfy 
the followings: 

• the function 5(-) is a contraction function for Q; 
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• (proper q-coloring) if /3 = 0, then Q is a family of q-colorable graphs. 

Then there exist two constants C \, C 2 > 0 such that the following holds: For every graph G(V,E) € 
Q with | V| = n, every vertex v E V, every color x € [q\, every set of vertices A C V \ {v} and two 
feasible instances IR = (G,A,p), SR = (G, A, tt) with p,i r£ [g] A being two configurations on A, it 
holds that 

|Pr Ql [c(u) = x] - Prn 2 [c(u) = cc]| < n Cl • exp (• -C 2 ■ £), 
where £ := dist(u, A) and AC A is the subset of A on which p and n differ. 

We prove the theorem by using the recursion introduced in Section 3 to estimate marginal 
probability on a vertex v. In each step, we show that the difference between the (logarithm of) 
marginal probabilities caused by different configurations on A contracts by a factor of <5(-), and 
therefore relate the difference of marginal probabilities to the contraction function. 

The following observation is useful: If <5(-) is a contraction function for Q , then for every graph 
G £ Q, every sufficiently long path in G must contain a low-degree vertex. The property is formally 
stated as: 

Lemma 16. Let Q be a family of finite graphs for which S(-) is a contraction function. Then for 
some constants 6 > 1 and C > 0, for every G = (V, E) € G with \V\ = n, every v € V and 
every L > Clogn, there exists a low-degree S in T = Tsaw(G, v) such that for every u € S, 
L < dist^ (u,v) < 6L and every self-avoiding walk in T from v of length 6L intersects S. 

Proof. Let G(V, E) € Q be a graph. It follows from the definition of contraction function that for 
some constant C > 0 , for every l > Clogn, £$(v,£) < a 1 for some constant 0 < a < 1 . 

It is sufficient to show that, for some constant integer 6 > 0 it holds that for every v € V, every 
L > Clogn, every P = (v,v\,..., vql) & SAW (v,9L), there exists a low-degree vertex Vj among 
{VL+1,V0L, ■ ■ ■, vbl}- 

Let 9 = max j flog^ ( afi-g) )"! > 2 j. Assume for the contradiction that every vertex in 
{vl + i, vl+ 2 , ■ ■ ■ ,V 0 l} has high-degree. Since 9L > L > Clogn, we have nf=i A(deg (v{)) < a eL . 

On the other hand, since 6(d) > (5(0) = , we have nt=i <5(deg (vf)) > • This is 

a contradiction for our choice of 8. □ 

4.1 The (3 > 0 case 

To implement the recursion introduced in Section 3, we define two procedures marg(fl, v, x,£) and 
marg-block(ll, B(v), 1 t,£) calling each other to estimate vertex and block marginal respectively. We 
assume 0 = (G, A, a) where G = {V. E ) is a feasible instance of Potts model, v € V \ A is a vertex, 
x € [q] is a color and £ is an integer. Recall that for a permissive block B(v), we use J~(B ) to 
denote the set of feasible configurations over B(v). 

Algorithm 1: marg(f2, v, x, £) 

1 If v is fixed to be color y. then return 1 if x = y and return 0 if x 7 ^ y; 

2 If £ < 0 return 1/q; 

3 Compute B(v)] 

4 For every p E lF(B(v)), let p p <— marg-block(0, B(v), p, £); 

5 Return min <j EneT(B(v)) Pn, m ax{i, g -(i-/3)deg G R)} 

I s.t. n(v)=x 
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To describe the algorithm for estimating the block marginals, we need to introduce some nota¬ 
tions. Let B = B(v), and we enumerate the boundary edges in 5B by e t = mvi for i = 1,2,..., m, 
where Vi 0 B. With this notation more than one u t or v l may refer to the same vertex, which is 
fine. For every i G [m] and p G J-(B), define Qb and Gf as in Lemma 11. 

Let Pi = (v,wi,W 2 , ■ ■ ■ ,Wk,Vi) be a self-avoiding walk from v to Vi such that all intermediate 
vertices Wi are in B(v). Since B(v ) is a minimal permissive block, such walk always exists, and let 
Pi be an arbitrary one of them if there are multiple ones. 

Algorithm 2: marg-block(G, B(v),ir,£) 


1 Compute Pi for every i € [m]; 

2 p itP G- marVi, pi, £ - \Pi\ ) for every i € [m] and p G E(B); 


3 Return 


WG[B}(K)Y[ ie [ m \p-^-P)Pi,-n) 
I2p£T(B) W G[B] ( P ) nie[m]P-p-^Pi, P ) ’ 


We need a few dehnitions to analyze the two procedures. 


Definition 17. Given an instance G = (G, A, a) of Potts model where G = ( V,E), a vertex 

v € V \ A, a color x € [(/] and an integer l. The computation tree of marg(fl, v, x, i), denoted by 
CT(Q,v,x,£), is a rooted tree recursively defined as follows: 

• The root of CT(G, v, x, l) is labeled (Q,v,x,£)', 

• For every recursive call to (ft', v', x', £') by marg(G, v, x, l) (in the subroutine marg-block), 
(Q,v,x,£) has a children which is the computation tree of (IT, v', x', £'). 

Define the termination set of marg(D, v, x, £) as the set of vertices u in the self-avoiding walk tree 
Tsaw {G\V \ A], v) that marg(fy, u, x', £') returns at step 2 for some leaf (fT, u, x', £') of CT(D, v, x, £). 

Thus the computation of marg(D, v, x, £) ends either at trivial instance (including vertex with 
fixed color and one-vertex graph), or at vertices in termination set. 


Definition 18. Given an instance G = (G, A, a) of Potts model with q > 3 and activity 0 < /3 < 1 
where G = (V, E) with |P| = n, a vertex v € V\ A. Let T = Tsaw (G[V \ A], v) be the self-avoiding 
walk tree rooted at v in G[V \ A] and S' be a set of low-degree vertices in T. Assume v has m 
children v\, V 2 , ■ ■ ■, v m in T, let T % denote the subtree of T rooted at p. We recursively define the 


error function: 


£t,s ■= 


YT= l 5 ( de gG ( * * v i)) • 

q + n log- 5 . 


if v(£S, 
otherwise. 


Definition 19. Given an instance G = (G,A,a) of Potts model where G = ( V,E ), a vertex 
v € V \ A, a color x € [g], an assignment n € P(B(v)) and an integer £. We denote pn,v/(x) = 
marg(Q,v,x,£) and = marg-block(G, B(v), tt, £). Define 


£nA v ) '■= max |log (psi, v ,e(x)) ~ log (Prn [c(u) = x])| ; 

x&[q] 

SnA B ( v )) '■= “ax |log (f>n,s(D,t( 7r )) “ lo g ( Pr o [c(B(v)) = tt])| . 

neS(B(v)) 

We use the convention that log 0 — log 0 = 0. 


The following key lemma relates the error functions we introduced above. 
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Lemma 20. Let Cl = (G,A,a) 
where G = (V,E). Assume V 


be an instance of Potts model with q > 3 and activity 0 < f3 < 1 
= n. Let v eV\ A be a vertex and x G [q] be a color. Let L > 0 


be an integer and S be the termination set of mar g(Cl,v, x,L). Denote T = Ts aw [G[V \ A], vj as 
the self-avoiding walk tree rooted at v in G\V \ A]. Then £^ L (y) < £f §. 


Proof. Let CT = CT(Cl,v,x, L). For every vertex (Ll,v,z,£) in CT where Q = (G = (V,E),A,a), 
we apply induction on the depth of CT(Ll,v,z,£) to show 


£n,i( v ) E ^T,SrV(T) ’ 

where T = Tsaw {G\V \ A],u) and V(T) is the set of vertices in T. 

The base case is that marg (Ll,v,z,£) is itself a leaf, namely it returns without any further 
recursive call to marg. Then 

• if it is returned at step 1, £q/(v) = 0; 


• if it is returned at step 5, £ci,e( v ) = 0; 

• if it is returned at step 2, due to Lemma 14, £q/(v) < q + deg G (v) log Jj. 

Assume the lemma holds for smaller depth and marg(J), v, z, £) is not a leaf. In Algorithm 1, 
the estimation of marginal is computed as: 


Pn,vA z ) = min < 


^ ' P^L ) B{v\t (^")i r-1 

max {1, g — (1 — /?)deg G (u)} 

< S.t. 7 t(v)=Z 


By Lemma 13, it always holds that Pro [c(r) = z\ < max {i i g_(ii / 3 ) degG ^)} • Thus assuming 
Pn,v,e( z ) = YlneT^BM) Pn,B(v),e( n ) will n ot make the error £t,s(v) smaller, and hence we have 

S.t. 7 t(v)=Z 


= max |l°g ( Pr n [ C <A) = x ])~ log Pn,v,i( x )\ 

xe[q] 


= max 
xe[q\ 


( 


log 


\ 


Pr o i c ( B (A) 


= 7T 


. 7TeJ(BW) 
\s.t. n(v)=x 


- log 




\ 


yi pq.,b(v)A' k ) 


. 7TeJ(B(»)) 

\S.t. 7r(v)=X 




< max (log (Pro [cr{B(v)) = tt]) - log (po.B^M) | 

7r€.F(£(v)) 

= 5o,f(5(r)). 


where the last inequality is due to that for every positive ai,..., a n , b ±,..., b n , S ie[7l] < max ie u T. 

2 -/ie[n] 0i 1 J 

Since marg (Ll,v,z,£) is not a leaf, the value of pci, v /( z ) I s returned at step 5 of Algorithm 1, 
and it is computed from the recursion in Algorithm 2. Recall 5B(v ) = {uiVi \ i G [m]}. For every 
i G [m], we let £{ = l — |Pj|. We claim that Lemma 12 implies 

< £ 2(1 ~ m 


max 


ie\m\ 


£ 

ie[m] 


q ~ (1 - /?)deg G (vi)-l P eB(B(v)) 


log (Pr Q p [<r(vi) = p*]) - log (pnewAiPi)) 


2 ( 1 - 0 ) 


q - deg G (vi) - 1 P €T{B(v)) 


max £ n P k (vi), 

~r~f r>t~.\\ i. ’ L 
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where fl? is obtained from 12 as in Lemma 12. 

To see this, note all Vi is on the boundary of a permissive block, thus either ,£-\Pi\( v i) = 0 
for every p (in case that the color of Vi is fixed), or by Lemma 13, 

Pr Q p [c(u») = Pi] < -7 z -wwT-7-^r- 

1 q ~ (1 - /?)deg G [yi) 

Also from step 5 of Algorithm 1, we have 

1 

,Vi,£i\Pi) — T\ n\ i 7 T* 

•’ ’ q - (1 - P)deg G (vi) 

With this upper bound for £ci,e{ v )^ and note that every P % is a self-avoiding walk from v to Vi 
with every vertex in B(v), we can then apply the induction hypothesis to complete the proof. □ 

We are now ready to prove the main theorem of this section. 

Proof of Theorem 15 when j3 > 0. By the definition of the strong spatial mixing, it is sufficient to 
prove the theorem for £ = 11 (log n). Let C and 0 be the constants in Lemma 16 and assume 
£ > 9\C\ogn\ be an integer. Let L = \£/9\ > Clogn. Consider pn ltV ,L (x) = marg(12i, v, x, L). Let 
S denote the termination set of marg(12i, v, x, L). By our choice of L, Lemma 16 implies that the 
set S satisfies 


1. every vertex in S is of distance (L, 2 L\ to u in T = Tsaw (G[V \ A], v), i.e., L < disty (v, u) < 
6L for every u E S; 

2. every path from v to A in T intersects S. 

It follows from Lemma 20 that 

I log (pn u v,l( x )) - log (Prn, [c(v) = x])| < Sn liL (v) < S T ,s, 


and similarly 

I log ( Pn 2 ,v,U x )) _ log ( Pr o 2 [ c (t) = A)\ < £u. 2 ,l{v) < £ t,s- 

On the otherhand, we have 

/ i \ eL 

£t,s < ( q + deg G (v) log - I • ^ £s(v, k) < n c l • exp {-C' 2 ■ £) 

' P' k=L +1 


for some constants C [, C' 2 > 0. 

Note that by the second property of S , pn ltV , x ,L = Pn 2 ,v,x,Li thus 

|Pr Ql [c(u) =x]~ Pro 2 [c(u) = x]| < |Pr Ql [c(u) = x\ - pn 1>v ,L( x ) I + l Pr o 2 [c(v) = A - p n 1 iV ,l(x)\ 

< n Cl ■ exp (—C '2 • £) 

for some constants C\, C 2 > 0. □ 
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4.2 The (3 = 0 case (Coloring model) 


Since the lower bound for marginal probability in Lemma 14 is zero for (3 = 0, the quantity £t,s 
defined in Definition 18 is no longer bounded above. We slightly modify the procedure marg to deal 
with this case. 

Let D = (G, A, a) be an instance of Potts model with q > 3 and activity (3 = 0 where G = ( V ., E), 
v £ V \ A be a vertex, x £ [g] be a color and £ be an integer. We define 

Algorithm 3: marg(D, v, x, £) 

1 If v is fixed to be color y. then return 1 if x = y and return 0 if x T y; 

2 Compute B(v)] 

3 If £ < 0, then return l/q if there is a feasible n £ T(B(v)) such that ir(v) = x and return 0 if 
no such 7r exists; 

4 For every p £ T(B(v)), let p p •£- marg-block(D, B(v), p, i)\ 

5 Return min \ E^(BR)) Pn, m ax{i. g -(i-/3)d e g G (,)} 

I s.t. n(v)=x 

The only difference of this version of marg is at step , where we check whether the color x is 
locally feasible. We return l/q if so and return 0 otherwise. 

Let T = Tsaw {G\V \ A], v) be the self-avoiding walk tree rooted at v in G[R\ A]. With our new 
version of marg, define the computation tree CT = CT (f2, v. x, £) the same as in Definition 17, while 
the termination set of marg(D, v, x,£) is defined as the set of vertices u in T that maxg(Q',u,x',£') 
returns at step 5 in Algorithm 3 for some leaf x',£') of CT- 

We can similarly define error functions as the (3 > 0 case, with difference on the base case. 


Definition 21. Given an instance D = (G, A, a) of Potts model with q > 3 and activity (3 = 0 
where G = (V. E) with \ V\ = n, a vertex v € V\ A. Let T = Tsaw (G[V \ A], v) be the self-avoiding 
walk tree rooted at v in G[V \ A] and S be a set of low-degree vertices in T. Assume v has m 
children v\, V 2 , ■ ■ ■, v m in T, let T % denote the subtree of T rooted at Tj. We recursively define the 


error function: 


£t,s ■= 


E™ 1 <5(deg G (Vi)) • S Ti ,s 
n log q. 


if v <(L S, 
otherwise. 


Lemma 22. Let Ci = 

G = (V,E). Assume 


G, A, a) be an instance of Potts model with q > 3 and activity (3 = 0 where 
V = n. Let v € V \ A be a vertex and x £ [q] be a color. Let L > 0 be 


an integer and S be the termination set o/marg(D,h, x , L). Denote T = Tsaw \ A], vj as the 

self-avoiding walk tree rooted hat v in G[V \ A]. Then £^ L {y) < £f 


Proof. The proof of this lemma is almost identical to the proof of Lemma 20, except at the base 
case of the induction. 

Consider the situation that (fl,v,z,£) is a leaf in CT = CT(Cl, v, x, L)), then 


• if it is returned at step 1, then £q/(v) = 0; 


• if it is returned at step 5, then £n/(v) = 0; 


• if it is returned at step 5, then we claim that £n/(v) < nlogg. To see this, it is sufficient 
to show that if there is a feasible 7r £ T(B(y)) such that n(v) = x, then Pro [c(u) = x\ > 0, 
and if no such 7r exists, then Pro [c(v) = x\ = 0 (We use the convention that log 0 — log 0 = 0 
and the fact that Pro [c(u) = x] > 0 implies Pro [c(v) = x\> q ~ n ). This is a consequence of 
Proposition 8 
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□ 


Proof of Theorem 15 when (3 = 0. With Lemma 20 replaced by Lemma 22, the proof is almost 
identical to the (3 > 0 case. □ 


5 Approximate Counting and Sampling 

In this section, we prove Theorem 3. We first show how to estimate the marginal probability in 
Potts model and it is routine to obtain FPTAS from this estimation. 


5.1 Estimate the marginals 

Theorem 23. Let q > 3 he an integer and 0 < (3 < 1. Let Q be a family of finite graphs that 
satisfies the followings: 

• the function 5(-) is a contraction function for Q; 

• (proper q-coloring) if (3 = 0, the family Q is q-colorable; 

• the family Q is locally sparse. 

Then for every feasible instance P = (G, A, a) of Potts model where G = (V,E) € Q with |E| = n, 
ACE and a G [g] A , for every vertex v G V and every color x G [q], there exists an algorithm that 
can compute an estimation p of Pr^ [c(u) = x\ in time polynomial in n, satisfying 


1-0 



< 


P 

Prn [c(r>) = x] 


<1 + 0 



Let Q be a family of finite graphs satisfying condition in Theorem 15. Let P = (G,A,<j) be an 
instance of Potts model where G = (V,E) G Q. Then for every vertex v G V, color x G [q] . set of 
vertices A C V \ {A U {w}} and a feasible configuration p 6 [q] A , we have shown in the proof of 
Theorem 15 that we can compute an estimate p of Pro [c(w) = x | c(A) = p\ such that, for some 
universal constants 01,02 > 0, 


I log p - log (Pro [c(u) = x I c(A) = p])| < n Cl ■ exp (-0 2 • l ), 


where i = distc (v, A), as long as £ > 01og?r for some constant C > 0. 

To prove Theorem 23, we show that if Q is locally sparse, then our estimation algorithm is also 
efficient, i.e., terminates in polynomial time for L = 0(log |E|). 


Lemma 24. Let q >3 be an integer and 0 < (3 < 1. Assume Q is a family of graphs satisfying 
condition in Theorem 23. Then there exists a constant C > 0 such that for every feasible instance 


P = (G 
Glog i 
0 ( 1 ) 


V 


(V,E),A, a) with G G Q, every vertex v G V \ A, every color x G [q] and every L > 
the procedure marg(P, v, x, L) (both (3 > 0 and (3 = 0 versions) terminates in time 


V 


exp (O(L)). 


Proof. Let C and 0 be the constants in Lemma 16. Fix S as the termination set of marg(P, v, x, L ). 
Let T = Tsaw (G[V\A], v y Then it follows from Lemma 16 that L < dist^, S^j < 6L. Let 
CT = CT(Ll,v,x, L) denote the computation tree of marg(P, v, x, L). 
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For every (Q,v,z,£) € CT, where 12 = (G(V, E), A, cr), consider the self-avoiding walk tree T 
that is obtained from Tsaw (G\V \ A], v) by removing all descendants of S. We use Vci,z to denote 


the set of self-avoic 
We claim that 


ing walks corresponding to the leaves of T. 


V , 




= exp(0(L)). To see this, note that 


V, 


Cl t v 


< J2k=i SAW^h, k), where 


SAWa(0, k ) is the set of self-avoiding walks of length k from v in G. On the other hand, we have 


V 


O(i) 


6L 

> y^£s(v,k) > 

k=1 


2(1 - (3) \ 9L 

q ~ 1 ' k =l 


SAW Mv,k), 


where the last inequality is due to 5(d) > for every d > 0. 

The time cost of each vertex marg (Cl, v, z, £) in CT besides the recursive calls is at most C' ■ 
mq B n(v) f or som e constant C' > 0 where m = |5i?Q(n)| is the size of edge boundary of B^(v). We 
use rn jV to denote the maximum running time of marg (Cl,v,z,£) over all colors z £ [g]. We apply 
induction on the depth of CT(marg(f2, v, z, £)) to show that i\i jV < C' Yl,p&v n „ (T^“ 6P . If the 

depth of CT(Cl,v, z,£) is one, the upper bound is trivial. Now assume the lemma holds for smaller 
depth. Denote B = Bq(v) and assume 5B = {uiVi \ i € [m]} be the edge boundary of B. Then 


m,v < V' Y m?,Vi + C' • mg |B| < g |B| Y] ( C' + max r^, Vi 

ir£p(B) i€[m] i€[m] 

Applying the induction hypothesis, we have for some it € T(B) 

t 


2 U ueP B w ( u ) 


m,v < c' • g |B| Y 1 + Y q 
ie[m] y PeVn? lVi 

< c'Y Y q 2 (l u “ epBn i r(u) l +|B| ) 

ie[m] PeV^ir 

X ’ 1 

< C’ ■ Y g 2 |UueP S n( u )| ; 

P&Pct.v 


where the last inequality is due to the following three facts: 


1 . each path in Vm yVi I s a P ar t °f some path in Va >v , and all these paths in Vn,v are distinct; 

2. B n = 0 for every i, u and 7r; 

3. B^(u) is at least as large as B^(u) for every u. 

Therefore, we have 


t q v — G' ■ Y g 2 l Uu6pBn(u) l 

P&Vn,v 


V 


0 ( 1 ) 

exp ( Q(L )) 


□ 


Proof of Theorem 23. Assume \V\ = n. Let C and 9 be the constants in Lemma 16. Fix some 
L > Clogn and denote S the termination set of marg(12, v, x, L). Let pn,v,L(x) = marg(12, v, x, L) 
and T = Tsaw (G[V \ A], v). Then it follows from Lemma 20 and Lemma 22 that 

I log (psi,v,l(x)) ~ log (Prn [c(u) = x])| < S T ,s- 
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We also have 


GL 

£t,s < n°^ ■ ^ £s(v, k ) < n Cl ■ exp (— C 2 ■ L) 
k=L -\-1 


for some universal constants C\. C 2 > 0. 

Thus for some L = O(logn) and L > Glogn, it holds that 


1-0 



Pn,v,s v (a) .. n ( J_\ 

~ Pro [c(v) = x] ~ \n 3 J ' 


The running time of the algorithm directly follows from Lemma 24. 


□ 


5.2 The sampling algorithm 

Theorem 25. Let Q be a family of graphs satisfying the conditions in Theorem 23. There exists an 
FPTAS to compute the partition function of Potts model with parameter q and [3 for every graph 
in Q. 


Proof. Let Q = (G, 0,0) be an instance of Potts model, where G(V,E ) e Q. Without loss of 
generality, we give an algorithm to compute an approximation of the partition function Z(Q) 
satisfying 


1-0 



< zm 

- z(n) 


<1 + 0 



Since our family of instances of Potts model is “self-embeddable” in the sense of [ -1 89], the algo¬ 
rithm can be boosted into an FPTAS. 

Assume V = {ui,... ,v n }. First find a configuration a £ [g] v such that wg(&) > 0. This task 
is trivial when /3 > 0. When (3 = 0 , since G is g-colorable, we can also do it in polynomial time: 


• If the graph is not empty, then choose a vertex v and find a feasible coloring of B{y). Then 
remove B(v ) from the graph and repeat the process. 

If G is g-colorable, then G[V \ B(y)\ is colorable as the boundary of B(v) consists of low-degree 
vertices, thus the above process will end with a proper coloring of G, which is the union of colorings 
found at each step. The process terminates in polynomial time since Q is locally sparse and thus 
the size of every B(v) is O(logra). 

With a in hand, we have 


Z(Q) = wc{cr )/Pr^ [c(V) 


a] = w G (a) 



A c (^) 

_i =1 




-1 


= w G (cr) ^JJPro 


c(vi) 


&(Vi) 


i —1 


A c ( v i) 

3= 1 


crivj) 


-1 


For every i € [n], let II,; = (G, A,,crj) where A, = {vi,..., u,_i} and cri(vj ) = cr(vj) for every 
j = 1, ..., i — 1. We have 


z(n) = w G (a) n pr ^ t c (^) 

\i=1 
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Note that the graph class Q is closed under the operation of fixing some vertex to a specific color, 
we can apply Theorem 23 for every f \ and obtain p^ such that 


1-0 



< 


_ Pi. 

Pro, [ c(vi ) 


°(vi)} 


<1 + 0 



Let Z(fl) = vjg((j) In'Ll Pi) \ then Theorem 23 implies that 


1-0 



< /m 

- z(n) 


<1 + 0 



□ 


Our approximate counting algorithm implies a sampling algorithm via Jerrum-Valiant-Vazirani 
reduction [ V86]. 

Corollary 26. Let q > 2 and 0 < P < 1 be two constants. For a family of graphs Q satisfying 
conditions in Theorem 23, and every graph G(V,E ) € Q with \V\ = n, there exists an algorithm S 
such that for any e > 0 with high probability S returns a random configuration in [q/' ^ from a 
distribution that is within total variation distance e from the Gibbs distribution pc f or the q-state 
Potts model with activity (3. And the running time of S is in polynomial in n and log -. When 
P = 0, i.e. for q > 3d + 4, with high probability S is an FPAUS (fully polynomial-time almost 
uniform sampler) for proper q-colorings of G. 


6 Random Graphs 

In this section, we prove Theorem 4. We first prove the following properties of Q(n, d/n). 

Theorem 27. Let d be a sufficiently large constant, q > 3(1 — ft) + 4 and G = ( V, E) ~ G{n, d/n). 
Then with probability 1 — o(l), the following holds 

• there exist two universal positive constants C > 0 ,7 < 1 such that £$(v,£) < n c rf for all 
v € V and for all £ = o(y/n); 

• if p = 0, then G is q-colorable; 

• there exists a universal constant C > 0 such that for every path P in G of length l, |L?(P)| < 
C{£ + log n). 

Note that the first property in above theorem impose an upper bound on l. This is not harmful 
as our algorithms for FPTAS and sampling only require the property holds for £ = O(logn). Thus 
Theorem 27 and Corollary 26 together imply Theorem 4. 

It is well-known that when P = 0, G is g-colorable with high probability (see e.g., [ 75]), we 

verify the first property in Lemma 28 and the third property in Lemma 30. 

6.1 Correlation decay in random graphs 

Lemma 28. Let d > 1, 0 < P <1 and q > 3(1 — P)d + 4 be constants. Let G(V,E ) ~ Q{n,d/n). 
There exist two positive constants C > 0 and 7 < 1 such that with probability 1 — 0 for every 
v € V and every £ = o(y/n), it holds that 

£s(v,£) < n c 
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We first prove a technical lemma. 


Lemma 29. Let 0 < (3 < 1 be a constant. Let f q (d) : M-° —>• M-° be a piece wise function defined 


as 


2(1 ~ft) 


ifd< 0-2 


«<0 ~ \ p 1 -' 1 -™ , 

I 1 otherwise. 

Let X be a random variable distributed according to binomial distribution Bin(n, where A > 1 
is a constant. Then for q > 3(1 — /3 )A + 2 and all sufficiently large n, it holds that E [f q (X)\ < . 

Proof. Let A = 1 — fi. Since f(d) is decreasing in q. we can assume q = 3AA + 2. Note that 


E [/M]<x 

d~Bin(n,A) ^ 


e [i - mi > AA. 

d~Bin(n,-§) ^ 


Let g(x) := 1 — f(x), then 


E [1 ~f(d)]= Y S{k)-p(k) 

d~Bin(n,f) k=0 


where p(k) = (£) (%) k (l - £) 
Define 


A\ n ~ k 


g(x) := 1 - 


2A 


2A 2 (x —A) 2A 3 (x — A) 2 2A 4 (x —A)" 


g-l-AA (g — 1 — AA) 2 (g-l-AA) 3 (q - 1 - AA) 4 
2A 5 (x — A) 4 2A 6 (x — A) 5 2A 6 (x — A) 6 


(q — 1 — AA) 5 (q — 1 — AA) 6 (q — 1 — AA) 6 


Then 


g(x) - g(x) = 


2A 6 (g — 1 — A — xA)(x — A) 6 
(g — 1 — x\)(q — 1 — AA) 6 


which is positive for x < — 2j. 

We now prove that 




Y g(k)-p(k ) 


> 


A - 1 


k=0 


A 


The expectation of <)(&) can be computed directly: 

1 


E \g(k)\ = 


n 5 (q — 1 — AA) 6 


(C^n 5 + C 4 ri l ± 0(n 3 )) , 


where 


C 5 = 1 - 2A + (12A - 20A 2 - 2A 3 - 2A 4 - 2A 5 - 4A 6 )A 

+ (60A 2 - 80A 3 - 12A 4 - 14A 5 - 74A 6 )A 2 + (160A 3 - 160A 4 - 24A 5 - 50A 6 )A 3 
+ (240A 4 - 160A 5 - 16A 6 )A 4 + (192A 5 - 64A 6 )A 5 + 64A 6 A 6 ; 

C 4 = 2A 3 (1 + 3A + 7A 2 + 46A 3 )A 2 + 2A 3 (6A + 18A 2 + 234A 3 )A 3 
+ 2A 3 (12A 2 + 69A 3 )A 4 + 16A 6 A 5 . 
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Since C 4 > 0, thus for sufficiently large to, it holds that 


E \g(x)] > 


C 5 

(q- 1 - AA) 6 ' 


We also have that 


L^-sJ 


E [g{x)]= Y S{k)-p{k)+ Y g{k)-p{k) 
k=0 fc=l £ x i -ij 

It can be verified that g(x) is monotonically decreasing in x when x > — 2 and g — 2^ = 


1+2A(A—1) \ 0 n 

1+2AA ) ^ u ‘ 

Thus we have 


L^-21 


Y 9{k) ■ p{k) > E [g{x)j > 


C 5 


k =0 


A - 1 


(q - 1 - AA ) 6 A 


+ h(A) 


where 


h{ A) = (1 + 10AA + (40A 2 - 2A 3 - 2A 4 - 2A 5 - 4A 6 )A 2 

+(80A 3 - 12A 4 - 14A 5 - 74A 6 )A 3 + (80A 4 - 24A 5 - 50A 6 )A 4 ) 

+(32A 5 - 16A 6 )A 5 ) • (A(l + 2AA) 6 )" 1 . 

It can be verified that h{ A) is positive for every 0 < A < 1 and A > 1. □ 

Proof of Lemma 28. Let v € V be arbitrary fixed and T v = Tsaw (G, v) and t > 0 be an integer. 
By linearity of expectation, we have 


E \£s{v,l)] < n l 



■ t 

n^(degG («»)) 

. 1=1 


P = (v, vi,..., vi) is a path 


Fix a tuple P = (v, v \,..., vf). To calculate the expectation, we construct an independent sequence 
whose product dominates n,-i ^(degg- (vf)) as follows. 

Conditioning on P = (v, v\, ..., vf) being a path in G. Let X\ , A 2 ,..., be random variables 
such that each X,; represents the number of edges between V{ and vertices in F\{ui,... ,vg }; and let 
Y be a random variable representing the number of edges between vertices in {v \,..., vi} except for 
the edges in the path P = (v, v\, ..., vg). Then X\, ..., Xg, Y are mutually independent binomial 
random variables with each X* distributed according to Bin(n — £, and Y distributed according 
to Bin(( 2 ) — £ + 1, ^), and for each Vi in the path we have deg G (vf) = X, + 2 + 1) with some 
Y\ +Y-2 + ■ ■ ■ + Yg = 2Y. 

Note that <5(deg G (vi)) = f q { deg G (u*)) where the function f q (x) is defined in Lemma 29. Note 
that the ratio f q {x)/f q {x — 1) is always upper bounded by 2, and we have f q (x + 1) < f q -i(x). 
Thus, conditioning on that P = (v, ..., Vg) is a path, the product n!=i ^g,g(^ e SG ( v i )) can be 

bounded as follows: 


n 5(deg G (Vi)) = n f q (Xi + Yi + 2) < 2 2y ]J / g _ 2 (X i ). 


i =1 


i =1 


i =1 
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Let d! = , then we have dl > d. Let X be a binomial random variable distributed according 

to Bin(n, +), thus X probabilistically dominates every X* whose distribution is Bin (n — £, £). Since 
X\, X 2 , ■ ■ ■, Xp, Y are mutually independent conditioning on P = (v, v \,..., vf) being a path in G, 
for any P = (v, v \,..., vp) we have 


E 


n ^( de SG ( v i)) 


. 2=1 


P is a path 


< E 




ru-2M 


2=1 


<E[4 y ]E [f q -2(X)Y 


Recall that Y ~ Bin ^( 2 ) — £ + 1, , the expectation E [4 1 ] 


can be bounded as 


P 


E [4 r ] < ^4 A 


k =0 


t 2 \{d\ k f_ dP f2 - k 


k j \ n 


1 - 


n 


= |1 +“) S«q> 


3 dr 


n 


Since q — 2 > 3(1 — fd)d' + 2, it follows from Lemma 29 that E [ f q - 2 (X )] < 4 = ■ Therefore, 


E 


][[<y(deg G (■ Vi )) 


,i=i 


P is a path 


< exp 


3d£ 2 \ /3(1 — /3)\ e 1 


n 




,-4 / S #' eXp| -“° g 


d' <j—4 

q-4 


3d(l — (3) J n 


+ 


3 dl 2 


Since l = o(y / re), 


E [£s(v,i)\ < exp £ log 


q~ 4 


+ o(l) 


Zd(\-p) t 

Then the lemma follows from the Markov inequality and the union bound. 


□ 


6.2 Locally sparse for random graphs 

Lemma 30. Let e > 0 be some fixed constant. Let d be a sufficiently large number, q > (2 + e)d 
and 0 < (3 < l be constants. Let G = (V, E ) ~ G(n, d/n). There exists a constant C > 0 such that 
with probability 1 — O ( d ), for every path P in G of length £, \B(P)\ < C(£ + logn). 

Given P = (vi,..., vl), we are going to upper bound the probability 

Pr [|B(P)| > t | P is a path] (3) 

for every t > 0. 

A vertex v is a high-degree vertex if deg G (u) > — 2. Thus the probability (3) is maximized 

when (3 = 0. Note that conditioning on P is a path gives each vertex at most two degrees, we can 
redefine the notion of “high-degree” as deg G (v) > q — 5 and drop the condition that P is a path. 
Thus it is sufficient to upper bound 

Pr[|B(P)|>2] 

with our new definition of high-degree vertices. 

Let G = (V, E) be a graph. We now describe a BFS procedure to generate B*(P) := B(P ) U 
dB(P). Since B*(P) is always a superset of B(P), it is sufficient to bound Pr [|P*(P)| > t]. For a 
vertex v € V, we use Ng(v ) to denote the set of neighbors of v in G. 

Initially, we have a counter i = 0, a graph Go = G, a set of active vertices Ao = {vi,V 2 , ■ • ■, vl} 
and a set of used vertices Uq = 0. 
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(PI) 


1. Increase the counter i by one. 

2. (If i < L) Define Gi(Vi,Ei) = Gj_i [K-i \ {d}]- Let U % = A -1 U {vi}. Let A = (A -1 U 
NGi-Avi)) \Ui- Goto 1. 

3. (If i > L) Terminate if A-i = 0- Otherwise, let u E A-i and let lA t = lli-\ U {rt}. 

(a) (If \N G (u)\ > q- 5) Define G*(A Ei) = G t -iAi\{«}]. Let A = (A-iUiV G ._ 1 (u i ))\Z4. 
Goto 1. 

(b) (If |IVg(w)| < q — 5) Define Gi = Gi-\. Let A = A-i \ Ui- Goto 1 
The following proposition is immediate: 

Proposition 31. Assume the algorithm terminates at step t, then B*(P) = Ut -1 and, |L>*(P)| = 
t- 1 

Let R = {ri,r 2 ,... ,ul} be a set and each r* is the root of tree T t . We now describe a BFS 
procedure to explore these L trees. For a vertex v, we use C(y) to denote its children. 

Initially, we have a counter i = 0 and a set of active vertices Bo = R. 

(P2) 

1. Increase the counter i by one. 

2. (If i < L) Let B-i = U C(rt )) \ {rt}. Goto 1. 

3. (If i > L) Terminate if = 0. Otherwise, let w € £>,_i 

(a) (If |G(AI > Let = (^_i U C(u)) \ {u>}. Goto 1. 

(b) (If |C(w)| < 2=5) Let 5* = Bi_i \ {u>}. Goto 1. 

Now assume G ~ Q{n,d/n ) and for every i E [L ], T t is a branching process with distribution 
Bin (n,d/n), i.e., each C(u) ~ Bin ([n],d/n). We can implement the (PI) when at each step i, the 
vertex u chosen from the active set sample its neighbors Nq z _ 1 (u) according to Bin (Vi,d/n). This 
random process can be coupled with Q(n,d/n ) such that B*(P) found by it is always a superset of 
the one in Q(n,d/n). 

We now construct a coupling of (PI) and (P2) with the property that the later one always 
terminates no earlier than the former one. 

At each step i > 1, let u and w be the vertex chosen from A and B t respectively (u = Vi and 
w = ri if i < L). Then | Nc t _ -iO)| r-j Bin(|p| ,d/n). We couple it with some x ~ Bin (n,d/n) with 
the property that x > \Ng,^i ( v i)\ and let C(w) be a set with x elements. 

Lemma 32. For every i >0, the following two properties hold: 

(11) There exists a surjective mapping F) from Bj to A in each step i. 

(12) For every u € A? we use ni{u) to denote the number of w E Bi such that Fi(w ) = u. Then 
for every uE Ai, ni(u ) > |iV G (tt)| - |IV G .(«)|. 
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Proof. We apply induction on i to prove the lemma. 

When i = 0, we let Fq : Bq —> Ao be the function that Fq( rj ) = Vj for every j € [L\. Then both 
properties hold trivially. 

Assume the lemma holds for smaller i. If i < L, since by our coupling, \C{ri)\ > |-/V Gi _ 1 (uj)|, we 
can construct F t by extending Iq_i with an arbitrary surjective mapping from C{rf) to Nc i _ 1 (vi). 
For every v! G A, if u' € Nc^Vi), then n^u 1 ) > n i _i(u / ) + 1 and |iV G ._ 1 ( u )| - \N Gi {u')\ = 1; 
otherwise rii(u f ) = rii-i(u f ) and |jV Gi _j («)| = I N Gi Induction hypothesis implies both (il) 

and (z2) hold. 

If i > L , we have to distinguish between cases: 

• (If |IV G (tt)| > q — 5 and Nc i _ 1 (u) > ^Fr) We construct F t by extending Fj_i with an arbitrary 
surjective mapping from C(w) to Nq 1 _ 1 (u), the same argument as i < L case proves (fl) and 
(* 2 ). 

• (If |A r G (w)| > q — 5 and Nq i _ 1 (u) < In this case, by induction hypothesis, we know that 

ni-i(u) > \N g (u)\ - \N Ci ,(a)| > > NGt-Av). 

Choose a surjective / from F~\(u) to Nq._ 1 (ii) and construct F, L from F,;_\ by replacing the 
mapping on Fff \ (u) by /. This is safe since u A % . The same argument as before proves 
(il) and (■ i2 ). 

• (If |IV G (u)| < q — 5) Construct Fi = Fi-\. Since everything does not change, the induction 
hypothesis implies (il) and {i 2). 

□ 


The first property above guarantees that (P2) terminates no earlier than (PI) and thus its 
stopping time is an upper bound for the size of B*(P) found by (PI). 

(P2) can be modeled as follows: 

1. Let X ~ Bin(n, d/n) and X\, Xi ... be an infinite sequence of independent random variables 
defined as follows 


• For i = 1, 2,..., L, Xj is an independent copy of X; 

• For i > L, Xj has following distribution 


Xi = 



if X < (qr — 5)/2 
otherwise. 


2. Yi, I25 • • • is an infinite sequence of random variables that Yq = L and Y t = Y t _i + Xi — 1 for 
every i > 1. 

3. Z = min t = 0}. 

The above process is identical to (P2), thus we have 
Proposition 33. (P2) terminates after step t if and only if Z > t. 

Note that Z > t implies Yt > 0, we turn to bound the latter. 
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Lemma 34. There exist two constants C i, C 2 > 0 depending on d and e such that 

Pr [Y t > 0] < exp (-Ci t + C 2 L). 

Proof. By the definition, Y t+L = L — (t + L) + = — t + Yhi =1 -W + X^l+i We know 

the distribution of JQs and we now compute their moment generating function. For every s > 0, it 
holds that 

Pr [Y t+L > 0] = Pr [e sYt + L > l] < E [e sYt + L ] = e~ st (e [ e sX ]) L (e [e^+i])*. 

Recall that X ~ Bin(n,d/n), we have E [e s ' Y ] = (l + |(e s — 1))” < e d ( eS_1 ). Let p = (q — 5)/2, 
we have 


E [e sXi +i] = Pr [x < p \ + Y eSk ■ Pr [ x = k \ 

k=\p\ 


< 1 + Y eSk - Pr i X > k \ 

k=\p\ 


< exp Y eSk Pr [X > fc] 

,fc=[pj 


By Chernoff bound, for sufficiently large d, we have for some choices of s > 0 and C\ > 0, 


Y eSk • Pr [X>k]-s< -C[. 
fc= LpJ 


Let C' 2 = d(e s — 1), we have 


Pr [P i+ L > 0] < exp (-Cjf + C' 2 L) . 
This implies for some constants Ci,C 2 > 0, 

Pr [Y t > 0] < exp (~C\t + C 2 L ). 


□ 

Proof of Lemma 30. By Lemma 34 and the union bound, the probability that there exists a path 
P in G of length £ such that \B(P)\ > t is upper bounded by 

n ■ n e ■ ^ • Pr [Y t > 0 \<n-d l - exp (—C\t + C 2 £) = O ^ 

for t = C(£ + logn) and sufficiently large constant C. □ 
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